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Abstract 

Caching is a technique to reduce peak traffic rates by prefetching popular content in memories at the end users. 
This paper proposes a novel caching approach that can achieve a significantly larger reduction in peak rate compared 
to previously known caching schemes. In particular, the improvement can be on the order of the number of end 
users in the network. Conventionally, cache memories are exploited by delivering requested contents in part locally 
rather than through the network. The gain offered by this approach, which we term local caching gain, depends 
on the local cache size (i.e, the cache available at each individual user). In this paper, we introduce and exploit 
a second, global, caching gain, which is not utilized by conventional caching schemes. This gain depends on the 
aggregate global cache size (i.e., the cumulative cache available at all users), even though there is no cooperation 
among the caches. 

To evaluate and isolate these two gains, we introduce a new, information-theoretic formulation of the caching 
problem focusing on its basic structure. For this setting, the proposed scheme exploits both local and global caching 
gains, leading to a multiplicative improvement in the peak rate compared to previously known schemes. Moreover, 
we argue that the performance of the proposed scheme is within a constant factor from the information-theoretic 
optimum for all values of the problem parameters. 
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I. Introduction 

The high temporal variability of network traffic results in communication systems that are congested 
during peak- traffic times and underutilized during off-peak times. One approach to reduce peak traffic is to 
£ — ; take advantage of memories distributed across the network (at end users, servers, routers, . . . ) to duplicate 
O ■ content. This duplication of content, called content placement or caching, is performed during off-peak 
^ ! hours when network resources are abundant. During peak hours, when network resources are scarce, user 
^ \ requests can then be served from these caches, reducing network congestion. In this manner, caching 
<3 ■ effectively allows to shift traffic from peak to off-peak hours, thereby smoothing out traffic variability and 
CN ■ reducing congestion. 

From the above discussion, we can see that the caching problem consists of two distinct phases. The 
first phase is the placement phase, which is based solely on the statistics of the user demands. In this 
^ ■ phase, the network is not congested, and the main limitation is the size of the cache memories. The second 
phase is the delivery phase, which is performed once the actual demands of the users have been revealed. 
In this phase, the network is congested, and the main limitation is the rate required to serve the requested 
content. 

Various versions of this problem have been studied, with the focus being mainly on exploiting the 
history or statistics of the user demands Hl-El. In these papers, the operation of the delivery phase is 
fixed to consist of simple orthogonal unicast or multicast transmissions. Assuming this method of delivery, 
the cache placement is then optimized. The gain of caching in this approach results from making popular 
content available locally. In another line of research, the objective is to optimize the delivery phase for 
fixed known cache contents and for specific demands 10, J9]|. 

As pointed out above, the gain from traditional caching approaches derives from making content 
available locally: if a user requests some content that is stored in its cache, it can be served from its 
local memory. We hence call this the local caching gain. This gain is relevant if the local cache memory 
is large enough such that a sizable fraction of the total (popular) content can be stored locally. On the 
other hand, if the size of the local caches is small compared to the total amount of content, then this gain 
is insignificant. 
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In this paper, we propose a novel caching approach that, in addition to the local caching gain, is able 
to achieve a global caching gain. This gain derives from jointly optimizing both the caching and delivery 
phases, ensuring that in the delivery phase several different demands can be satisfied with a single multicast 
transmission. Since the cache placement is performed without knowledge of the actual demands, in order 
to achieve this gain the placement must be carefully designed such that these multicasting opportunities 
are created simultaneously for all possible requests in the delivery phase. We show that this global caching 
gain is relevant if the aggregate global cache size is large enough compared to the total amount of content. 
Thus, even though the caches cannot cooperate, the sum of the cache sizes becomes an important system 
parameter. 
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(a) Caching performance for N = 50 files and K = 50 users. For a 
memory able to store M = 10 files, the rate required by the proposed 
scheme is a factor 11 smaller than the rate required by conventional 
schemes. 



(b) Caching performance for N = 500 files and K = 500 users. 
For a memory able to store M — 100 files, the rate required by the 
proposed scheme is a factor 101 smaller than the rate required by 
conventional schemes. 



Fig. 1. Rate R required in the delivery phase as a function of memory size M. The dashed green curve shows the performance of 
conventional caching strategies. The solid blue curve is the performance of the proposed new caching scheme. 



To formally analyze the performance of the proposed caching scheme, and in order to evaluate and 
isolate these two gains, we introduce a new, information-theoretic formulation of the caching problem 
focusing on its basic structure. In our setting, K users are connected to a server through a shared, error- 
free link. The server has a database of N files of equal size and equal popularity. Each of the users has 
access to a cache memory big enough to store M of the files. During the placement phase, the caches 
are filled as a function of the database. During the delivery phase, each user may ask for any one of the 
N possible files. The objective is to design the placement and delivery phases such that the rate in the 
delivery phase is minimized. For simplicity, we restrict the discussion in the introduction section to the 
most relevant case, in which the number of files N is larger than the number of users K. The main results 
are presented later for the general case. 

In the above setting, the rate in the delivery phase without any caching is equal to K files. The two 
caching gains mentioned above are shown to be as follows: 

• Local caching gain: The rate is reduced by a factor 1 — M/N. This gain depends on the normalized 
local cache size M/N . 

• Global caching gain: The rate is reduced by a factor 1+I ^ M ^ N • This gain depends on the normalized 
cumulative cache size KM/N. 

Observe that the local caching gain is relevant only if the cache size M is on the order of the number of 
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files N. On the other hand, the global caching gain is relevant whenever the cumulative cache size KM 
is on the order of (or larger than) the number of files N. 

As pointed out earlier, conventional caching solutions exploit only the local caching gain. In this setting, 
the rate in the delivery phase of conventional caching schemes is 

K ■ (1-M/N). 

On the other hand, the new caching scheme proposed in this paper attains both the local and the global 
caching gains. The rate in the delivery phase of the proposed scheme is 

K-(l- M/N) ' 



1 + KM/N 

Moreover, by deriving fundamental lower bounds on the rate required in the delivery phase, we show that 
this rate is within a factor 12 of the information-theoretic optimum for all values of N, K, and M. 

To obtain some intuition about the effect of these two gains, let us compare the rates of the conventional 
scheme (achieving only the local gain) versus the proposed scheme (achieving both the local and global 



gains) for a system with N = 50 files and K = 50 users as shown in Fig. |l(a)[ When each user has a 
cache memory large enough to store M = 10 files, the rate of the conventional scheme corresponds to 
sending 40 files over the shared link, while the proposed scheme achieves a rate corresponding to sending 
only 3.6 files: a reduction by a factor 11 in rate. 

The gain of the proposed scheme compared to conventional strategies is even more pronounced for 
larger values of iV and K, as can be seen in Fig. |l(b)| for the scenario with 500 files and users. For 



general N and K, the performance gain of the proposed scheme over conventional caching strategies can 
be up to a factor of order min{iV, K}. 

The remainder of this paper is organized as follows. Section [XT] formally introduces our information- 
theoretic formulation of the caching problem. Section [HI] presents main results, which are illustrated with 
examples in Section lIVl Sections [VT - tVIII contain proofs. 

II. Problem Setting 

Before formally introducing the problem in Section III-BL we start with an informal description in 
Section UTAl 



A. Informal Problem Description 

We consider a system with one server connected through a shared, error-free link to K users. The 
server has access to a database of N files W\,..., W N each of size F bits. Each user k has an isolated 
cache memory Z k of size MF bits for some real number M £ [0, N]. 

The system operates in two phases: a placement phase and a delivery phase. In the placement phase, 
the users are given access to the entire database W\, . . . , Wn of files. Each users k is then able to fill 
the content of its cache Z k using the database. In the delivery phase, only the server has access to the 
database of files. Each user k requests one of the files Wd k in the database. The server is informed of 
these requests and proceeds by transmitting a signal Xr dl d k ) °f s i ze RF bits over the shared link for 
some fixed real number R. Using the content Z k of its cache and the signal X^,...,^) received over the 
shared link, each user k aims to reconstruct its requested file Wd k - 

We say that a memory-rate pair (M, R) is achievable for requests di, . . . ,dx if every user k is able to 
recover its desired file W k (with high probability). A memory-rate pair (M, R) is said to be achievable if 
this pair is achievable for every possible request di, . . . , dx in the delivery phase. Finally, we denote by 
R*(M) the smallest rate R such that (M,R) is achievable. The function R k (M) describes the memory- 
rate tradeoff for the caching problem. The aim of this paper is to characterize this memory-rate tradeoff. 
In other words, we aim to find the minimum rate of communication over the shared link at which all 
possible demand tuples can be satisfied. 
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We illustrate these definitions with the example of the caching strategy employed by conventional 
caching systems. 
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(a) Original problem. 



(b) Compact representation. 



Fig. 2. The caching problem with N = 2 files and i(" = 2 users. Zi and are caches of size MF bits. Xr^^) 
RF bits sent over the shared link responding to user one and two requesting files Wd 1 and Wd 2 , respectively. 



is the signal of size 



Example 1 (Conventional Scheme). Consider the caching problem with N = 2 files W\, W 2 and K = 2 
users as depicted in Fig. [2l In the placement phase, users one and two fill the contents Z\ and Z 2 of their 
respective caches as a function of both files W\ and W 2 . In the delivery phase, each user requests a file, 
say user one request W 2 and user two requests W\. As a response to these requests, the server transmits 
X(2 y \) over the shared link. Using only Z\ and -XWi), user recovers an estimate W 2 of the desired message 
W 2 . Similarly, using only Z 2 and Xni), user two recovers an estimate W\ of the desired message W\. 

For concreteness, assume that the caches have size F so that M — 1. One possible caching strategy is to 
place an equal fraction of each of the iV files in the cache. In this example, each of the two caches Z\ and 
Z 2 would then consist of the first halves of the files W\ and W 2 . In the delivery phase, the server simply 
transmits the second halves of the two files. This requires F bits to be sent so that R — 1. Clearly, from 
the content of their caches and the signal sent over the shared link, the users can recover the requested 
files. 

We refer to the caching strategy described in the last paragraph as the conventional caching scheme, 
since it is the strategy employed by traditional caching systems. For general N, K, and M, in the 
conventional scheme each user caches the same M/N fraction of each file. In the delivery phase, the 
remaining 1 — M/N fraction of any requested file needs to be transmitted over the shared link. Therefore, 
the delivery rate in the conventional scheme is 

mm{N,K} ■ (1- M/N). 

As we shall see in the following, this conventional caching strategy can be significantly improved upon, 
especially for large values of K and N. 



B. Formal Problem Statement 

We now provide the formal definition of the information-theoretic caching problem. Let (W n )^ =1 be 
iV independent random variables each uniformly distributed over 

[2 F ]4{l, 2 ,...,2 F } 

for some F G N. Each W n represents a file of size F bits. A (M, R) caching scheme consists of K 
caching functions, N K encoding functions, and KN K decoding functions. 
The K caching functions 

[2T-+[2L™J] 
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map the files W±, . . . , W N into the cache content 

z k ±<t> k {w u ...w N ) 

for each user k e [K] during the placement phase. The N K encoding functions 

4> idl ,..., dK y. [2 F ] N ^ [2 [FR1 ] 

map the files W\, . . . , Wn to the input 

X( dl ,...,d K ) - ^(di ,-4k) ( W h W N ) 

of the shared link responding to the requests (di, . . . , d K ) e [N] K during the delivery phase. Finally, the 
KN K decoding functions 

^*^*r)^: [2 lJUFJ ] X pL'^J] [S^ 
map the signal received over the shared link X{d u ...,d K ) and the cache content Z k to the estimate 

W( dl ,...,d K ),k — fJ'(di,...,d K ),k{X(d 1 ,...,d K )^ Z k) 

of the requested file Wd k of user k e [K]. The probability of error is defined as 

max maxP(W( dl djr ) fc ^ W dk ). 

Definition. The pair (M, i?) is achievable if for every e > and every large enough file size F there 
exists a (M, i?) caching scheme with probability of error less than e. We define the memory-rate tradeoff 

R*(M) = inf {R : (M, i?) is achievable}. 



III. Main Results 



For ease of exposition, we first consider the case N > K. The next result presents an achievable rate, 
yielding an upper bound on the memory-rate tradeoff R*(M). 

Theorem l.A. For N e N files and K E {1,2,..., iV} users each with cache of size 
MGf-{0,l,..,4 

R*(M) < R(M) = K ■ (1 - M/N) ' 



1 + KM/N 

is achievable. For general < M < N, the lower convex envelope of these points is achievable. 

The caching algorithm achieving R(M) is described in Section |Vj The rate R(M) in Theorem ll.AI is 
composed of three distinct factors. Observe that in the absence of caches at the users (i.e., M = 0), the 
worst-case rate with unicast delivery is K. This is the first factor in R(M). 

Recall from Example Q] in Section III- A I that the conventional caching scheme achieves a rate of 

K-{l-M/N). (1) 

Therefore, the conventional scheme offers a caching gain of a factor 1 — M/N, which we call the local 
caching gain. This local caching gain is the second factor in R(M). Observe that this local gain is a 
function of the normalized local memory size M/N and is relevant whenever M is on the order of N. 

Finally, the third factor in R(M) is the global caching gain 1+I ^ M / N - This gain is a function of the 
normalized global or cumulative memory size KM/N and is relevant whenever KM is on the order of 
(or larger than) N. This global gain is to be interpreted as a simultaneous multicasting gain available 
for all possible demands. Note that, since the number of users is smaller than the number of files, in 
the worst case all users request different files. Hence, when N > K, there are no natural multicasting 
opportunities. The scheme proposed in Theorem ll.AI carefully designs the cache placement in order to 
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create multicasting opportunities in the delivery phase even among users that request different files. Since 
the placement phase is performed without knowledge of the actual demands, care must be taken to ensure 
that the same multicasting opportunities are created simultaneously for every possible set of requests in 
the delivery phase. 

We point out that the conventional caching scheme introduced in Example CD achieves only the local 
caching gain, whereas the caching strategy proposed in Theorem 1 1 . A| achieves both the local as well as 
the global caching gains. The following two examples compare these two gains. 

Example 2 (Q(K) Improvement in Rate). Consider a situation with the same number of users as files, 
i.e., N = K. Assume each user has enough local cache memory for half of the files so that M = N/2. 
Then the local caching gain is 1/2 and the global caching gain is 1/(1 + K/2). By (Q~|), the conventional 
scheme achieves a rate of K/2. On the other hand, by Theorem ll.Al the scheme proposed in this paper 
achieves a rate of (K/2)/(l + K/2) < 1: a reduction by more than a factor K/2 in rate compared to the 
conventional scheme. We refer the reader to Fig. \T\ in Section U for a visualization of the effect of this 
improvement. 

Example 3 (Q(K) Improvement in Slope). In this example, we compare the performance of the proposed 
and conventional schemes for small values of the local cache size M. We consider again the case N = K. 
From ([T]), the slope of the rate of the conventional scheme as a function of cache size M around M = 
is equal to —1. On the other hand, by Theorem ll.Al the scheme proposed in this paper has a slope of less 
than —K/2 around M = 0. Therefore, the rate of the proposed scheme reduces with the local cache size 
at least K/2 times faster than the conventional scheme. Comparing the rates of conventional and proposed 
scheme in Fig. \T\ in Section U for small values of M illustrates the effect of this improvement. 

For the case N < K, the achievable rate is stated in the following theorem. 

Theorem l.B. For N e N files and K e {N + 1, N + 2, . . . } users each with cache of size 



is achievable. For general < M < N, the lower convex envelope of these points is achievable. 

The proof of Theorem I l.BI is reported in Section |V] Again, the rate R(M) in Theorem I l.BI is composed 
of three factors. The first two factors are the same as for the case N > K (worst-case rate with unicast 
delivery K, local caching gain 1 — M/N). 

However, the third factor (the global gain) is different. For N < K, this gain is the minimum of two 
terms. The first term 1+I ^ M ^ N is the same as in Theorem ll.Al for N > K. Recall that this global gain is a 
simultaneous multicasting gain created by careful cache placement to allow multicasting opportunities for 
all possible requests in the delivery phase. However, for a situation with fewer files than users, there exists 
already a natural multicasting opportunity: by multicasting all iV files to the K users, we can achieve a 
gain of N/K. The scheme in Theorem 1 1 .Bl achieves the minimum of these two gains. We point out that 
for M > 1 — N/K this minimum is achieved by the first of the two gains. In other words, the natural 
multicasting gain is relevant only when the memory size is very small. 

Having established an upper bound on the memory-rate tradeoff R*(M), we proceed with a lower 
bound on it. 

Theorem 2. For N E N files and K 6 N users each with cache of size < M < N, 



The proof of Theorem [2l presented in Section |VT1 is based on a cut-set bound argument. While tighter 
lower bounds on R*(M) can be derived using stronger arguments than the cut-set bound (see the discussion 
in Example @] in Section HVl) . these are not required for the purposes of this paper. 
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Comparing the achievable rate R(M) in Theorem 1 with the lower bound in Theorem [21 we obtain the 
following approximation result for the memory -rate tradeoff R k (M). 

Theorem 3. For N E N files and K e N users each with cache of size < M < N, 

R(M) 

with the achievable rate R(M) as defined in Theorem 1. 

The proof of Theorem [3] is presented in Section [VTll The bound R(M)/R*(M) < 12 on the approxi- 
mation ratio of the proposed caching scheme is somewhat loose due to the analytical bounding techniques 
that were used. Numerical simulations suggest that 

gW < 5 
R*(M) ~ 

for all N, K, and < M < N. 

Theorem [3] shows that the rate R(M) of the proposed scheme in Theorem 1 is close to the information- 
theoretic optimum R*(M) for all values of the system parameters. More precisely, it shows that no scheme 
can improve upon the rate R(M) of the proposed scheme by more than a factor 12. This also implies that 
the local and global caching gains identified in this paper are fundamental: there are no other significant 
caching gains beyond these two. 



IV. Examples 




Fig. 3. Memory-rate tradeoff for N = 2 files K — 2 users. The achievable rate R(M) from Theorem 1 is indicated by the solid blue 
curve. The lower bound on R* (M) from Theorem [2] is indicated by the dashed red curve. For the N — K = 2 case, R*(M) can be found 
exactly and is indicated by the dotted black curve. 



Example 4. Consider the case N = K = 2, so that there are two files, say W% = A, W2 = B, and two 
users each with cache memory of size M. The upper and lower bounds in Theorems 1 and [2] on the 
memory-rate tradeoff R*(M) are depicted in Fig. [3] To illustrate the proof techniques, we now show how 
these two bounds are derived for this simple setting. 

We start with the upper bound in Theorem 1 . First, let us consider the two extreme cases M = and 
M — N. If M — 0, the server can always transmit both files A and B over the shared link. Since this 
satisfies every possible request, the (M, R) pair (0, 2) is achievable. If M = 2, each user can cache both 
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shared link: 



user 2: 
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Fig. 4. Caching strategy for N — 2 files and K = 2 users with cache size M = 1. Each file is split into two subfiles of size 1/2, i.e., 
A — (AijAz) and B = (B\,B<i). The scheme achieves rate R = 1/2. 



files A and B in the placement phase. Therefore, no communication is needed in the delivery phase and 
the (M, R) pair (2, 0) is achievable. 

Consider then a cache size M = 1. The caching scheme achieving the upper bound in Theorem 1 is as 
follows (see Fig. |4j. We split both files A and B into two subfiles of equal size, i.e., A = (A 1 , A 2 ) and 
B = (Bi,B 2 ). In the placement phase, we set Z\ = (Ai,Bi) and Z 2 = (A 2 ,B 2 ). In words, each user 
caches one exclusive part of each file. For the delivery phase, assume for example that user one requests 
file A and user two requests file B. Given that user one already has subfile A\ of A, it only needs to 
obtain the missing subfile A 2 , which is cached in the second user's memory Z 2 . Similarly, user two only 
needs to obtain the missing subfile B\, which is cached in the first user's memory Z\. In other words, 
each user has one subfile of the file that the other user needs. 

The server can in this case simply transmit Xn 2 ) = A 2 © B\, where © denotes bitwise XOR. Since 
user one already has B\, it can recover A 2 from A 2 (B B 1 . Similarly, since user two already has A 2 , it 
can recover Bi from A 2 © B\. Thus, the signal A 2 © B\ received over the shared link helps both users 
to effectively exchange the missing subfiles available in the cache of the other user. 

The signals sent over the shared link for all other requests are depicted in Fig. HI One can see that in 
all cases the signal is constructed using the same logic of exchanging the missing subfile. This proves the 
achievability of the (M, R) pair (1,1/2). 

So far, we have shown that the (M,R) pairs (0,2), (1,1/2), and (2,0) are achievable. On the other 
hand, it is easy to see that if any two points (Mi, Ri) and (M 2 , R 2 ) are achievable, the line connecting 
these two points is also achievable. This can be shown by dividing the cache memories and the transmitted 
signal proportionally between the two caching schemes. Together, this proves the achievability of the solid 
blue curve in Fig. |3j which coincides with the upper bound stated in Theorem 1. 

We continue by analyzing the lower bound on R*(M) in Theorem [2] The proof relies on the cut-set 
bound. We consider two cuts. Referring to Fig. |2(b)| in Section HH the first cut separates (X^i j2 ), Z 1 , Z 2 ) 
from (K 7 !,^)- Assume (M,R) is an achievable memory-rate pair. Then this cut has capacity at most 
RF + 2MF, since -XV 2 ) is at most RF bits by definition of achievability, and since Z\, Z 2 contain each 
MF bits. On the other hand, since W\ can be recovered from (X( 12 ), Z{) and W 2 can be recovered from 
(X(j 2 ), Z 2 ), the number of bits that need to be transmitted over this cut is at least 2F. Hence, 

RF + IMF > 2F, 

so that 

R > 2 - 2M. 

As this holds for all achievable memory -rate pairs (M,R), we conclude that 

R*(M) > 2 - 2M. 
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The second cut separates (X( lj2 ),X( 2 ,i), Zi) from (Wi,W 2 ). This cut yields 

2RF + MF > 2F 
for any achievable memory-rate pair (M,R), implying that 

R*(M) > 1 - M/2. 

Together, this yields the dashed red curve in Fig. [3l which coincides with the lower bound stated in 
Theorem |2l 

For the case N = K = 2, the memory-rate tradeoff can in fact be found exactly and is indicated by the 
dotted black curve in Fig. [3j This is argued by showing that the pair (M, R) = (1/2, 1) is also achievable, 
and by deriving the additional non cut-set bound 

R*(M) > 3/2 - 3M/2. 

This shows that, while the bounds in Theorems 1 and [2] are sufficient to characterize the memory-rate 
tradeoff R*(M) to within a bounded multiplicative gap, neither the achievable region nor the converse 
are tight in general. The details of this derivation are reported in the Appendix. 




Fig. 5. Memory-rate tradeoff for N — 3 files and K — 3 users. The achievable rate R(M) from Theorem 1 is indicated by the solid blue 
curve. The lower bound on R* (M) from Theorem [2] is indicated by the dashed red curve. 

Example 5. In this example, we assume that iV = K = 3 so that there are three users and three files, 
say Wi = A,W 2 = B, and W 3 = C. Again, it is trivial to see that the (M, R) pairs (0, 3) and (3, 0) are 
achievable. We focus on two nontrivial cases M = 1 and M = 2. 

Consider first caches of size M = 1. We split each file into three subfiles of equal size, i.e., A = 
(Ai, A 2 , A 3 ), B = (Bi, B 2 , B 3 ), and C = (Ci, C 2 , C 3 ). In the placement phase, the cache content of user 
k is selected as = (Ak, Bk, Ck). A more formal way to describe this cache placement, which is a bit 
exaggerated for this simple setting but will be useful for the general setting below, is as follows. Let T 
be a subset of one element of {1, 2, 3}. Then subfiles A T , B T , Cj- are placed into the cache of user k if 
k eT. For example, A\ is cached at user one since in this case T = {1}. 

For the delivery phase, let us consider as an example that user one requests file A, user two requests 
file B, and user three requests file C. Then the missing subfiles are A 2 and A 3 for user one, B\ and B 3 
for user two, and G\ and C 2 for user three. Given the cache contents, users one and two aim to exchange 
A 2 and B x , users one and three aim to exchange A 3 and Ci, and users two and three aim to exchange 
B 3 and C 2 . The signal -XVl 2,3) = (A 2 © Bi, A3 © Ci, B 3 © C 2 ) enables all of these three exchanges. All 
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other requests can be satisfied in a similar manner. Since each subfile has rate 1/3, the proposed scheme 
achieves a rate of 1, and therefore (M, R) pair (1, 1) is achievable. 

Consider next caches of size M = 2. In this case, each file is again split into three subfiles of equal 
size. However, it will be convenient to label these subfiles differently, namely A = (Ai 2 , A13, A23), 
B = (B12, Bi 3 , B23), and C = (C 12 , C 13 , C23). In the placement phase, the caching strategy is 

Z\ = (A12, A13, B12, Bis, C12, C13), 
Z 2 = (A12, A23, B12, B 2 3, C12, C23), 

Z3 = (A13, A23, B13, B23, C13, C23). 

This cache placement can be understood as follows. Let T be a subset of two elements of {1, 2, 3}. Then 
subfiles At, Br, Cr are placed into the cache of user k if k e T. For example, Ai 3 is cached at users 
one and three since in this case T = {1,3}. 

For the delivery phase, let us again assume as an example that user one requests file A, user two 
requests file B, and user three requests file C . In this case, user one misses subfile A23, which is available 
at both users two and three. User two misses subfile B13, which is available at both users one and three. 
And user three misses subfile C12, which is available at both users one and two. In other words, the three 
users would like to exchange the subfiles A23, Bi 3 , Ci 2 . This exchange can be enabled by transmitting the 
signal X/12,3) = A23 © -B13 © C\ 2 over the shared link. Given its cache content, each user can then recover 
the missing subfile. All other requests can be satisfied in a similar manner. The rate of transmission in 
the delivery phase is 1/3, and therefore (M,R) pair (2, 1/3) is achievable. 

The arguments so far show that the solid blue curve in Fig. [5] is achievable. This coincides with the 
upper bound in Theorem 1 . 

For the lower bound on R*(M), we use two cut-set bounds. The first cut separates (X( 1)2| 3), Zi, Z 2 , Z 3 ) 
from (A,B,C). This is a valid cut since (X(i )2)3 ), Zi), (X(i )2j3 ), Z 2 ), and pT(i )2j3 ), Z 3 ) determine A, B, 
and C, respectively. For any achievable (M, R) pair, the capacity of this cut is at most RF + 3MF, and 
the number of bits that need to be transmitted over it is at least 3F. Hence, 

RF + 3MF > 3F, 

which implies that 

R*(M) > 3 - 3M. 

The second cut separates (X(i )2)3 ), X( 3j i )2 ), X(2,3,i), Z\) from (A, B, C). This is a valid cut since (X(i j2)3 ), Zi), 
(X( 31j 2), Z\), and (Xr 2 ,3,i), Zi) determine A, C, and B, respectively. For any achievable (M, R) pair, the 
cut capacity is at most 3RF + MF, and the number of bits that need to be transmitted over it is at least 
3F. Hence, 

3RF + MF > 3F, 

which implies that 

R*(M) > 1 - M/3. 

Together, these two cut-set bounds result in the dashed red curve in Fig. [5J This coincides with the lower 
bound in Theorem [2l 

V. An Order-Optimal Caching Strategy (Proof of Theorem 1) 
We now present the general achievable scheme. To simplify the description of the algorithm, the notation 

[N] ^ {!,..., AT}, 

[*]={!, •••,#} 
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is used. We first describe the algorithm in words. Consider cache size MgI'{0,1,..., K}, and set 

t = MK/N. 

Observe that t is an integer between and K. 

If M = 0, then in the delivery phase the server can simply transmit the union of all requested files 
over the shared link, resulting in FminjiV, K} bits to be sent. Hence 

R*(0) < min{N,K}. 

If M = N, then all files in the database can be cached at every user in the placement phase. Hence 

R*(N) = 0. 

Assume in the following that M is strictly larger than zero and strictly less than N, so that t G 
{1,2,...,K-1}. 

In the placement phase, each file is split into (^) nonoverlapping subfiles of equal size. It will be 
convenient to label the subfiles of file W n as 

W n = {W n , T :Tc[K\,\T\=t). 

For each n E [N], subfile W n j is placed in the cache of user k if k £ T. Thus, each user caches a total 
of N(~}) subfiles. Since each of these subfiles has size Fj ( ), this requires 

Nf*- 1 ) — = F— = FM 

iV \t-\ i (K-^ K 

bits of cache memory at each user, satisfying the memory constraint. 

Example 6. Let iV = K = 3 and M = 2. Then t = 2 and the cache placement is 

Zi = (W n , {lt2} ,W n , {l)3} f n=1 , 

z 2 = (w nt{m ,w n ^ 3} )l =1 , 
z 3 = (w n>{l>3} ,w ni{m ) 3 n=l , 

as we have already seen in Example [5] in Section [IV] 

We next describe the delivery phase. Consider the request vector (di, . . . , djc) € W] K , i.e., user k 
requests file Wa k - We focus on a subset S C [K] of |«S| = t + 1 users. Observe that every t users in S 
share a subfile in their caches that is needed at the remaining user in S. More precisely, fix a user k E S, 
and note that |«S \ {k}\ = t. The subfile Wd kt s\{k} is requested by user k since it is a subfile of Wd k . At 
the same time, it is missing at user k since k $L S \ {k}. Finally, it is present in the cache of any user 
seS\{k}. 

For each such subset S C [K] of cardinality \S\ = t + 1, the server transmits 

®sesW ds)S \{s}, 

where © denotes bitwise XOR. Each of these sums results in Fj (v") bits being sent over the shared link. 
Since the number of subsets S is ( J , the total number of bits sent over the shared link is 

pp _ ( k \ F 

c \t+V ^K^ 

_ K{1 - M/N) 
l + KM/N ' 
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where we have used that t = MK/N. 

Example 7. Let N = K = 3 and M = 1. Then t = 1 and the cache placement is Z k = (W n ^k})n=v F° r 
request (di, d 2 , d 3 ) = (1, 2, 3), the signal transmitted in the delivery phase is 

*(1,2,3) = (Wi, {2 } © W 2 ,{1}, W 1>{3} © W 3 , { i h W U3} © W M2} ), 

as we have already seen in Example |5] in Section |IVj Here, the three sums correspond to S = {1,2}, 
S = {3, 1}, and S = {3, 2}, respectively. 

We now argue that each user can successfully recover its requested message. Consider again a subset 
S C [K] with | S | = t + 1. Since each user k E S already has access to the subfiles W dsiS \{ s } for all 
s E S \ {k}, it can solve for Wd kt s\{k} from the signal 

®sesW ds , s \{s} 

sent over the shared link. Since this is true for every such subset S, each receiver k is able to recover all 
subfiles of the form 

{W dktT :Tc[K]\{k},\T\=t} 

of the requested file Wd k - The remaining subfiles are of the form Wd k ,T f° r 7" sucn that k E T. But these 
subfiles are already available in the cache of user k. Hence each user k can recover all subfiles of its 
requested file Wd k . 

This shows that, for M E f • {1, 2, . . . , K - 1}, 

mm < K{1 ~ M,N) 

R (M) " 1 + KM IN 

As was pointed out earlier (see Example @] in Section HVl) . the points on the line connecting any two 
achievable points are also achievable. Together with the upper bound for M E {0, iV} derived earlier, this 
proves Theorem ll.AI Similarly, taking the minimum between the rate derived here and the rate 

min{iV, K}(1 — M/N) 

achieved by the conventional scheme described in Example Q] in Section III-AI proves Theorem ll.Bl ■ 
For completeness, we now formally describe the caching algorithm for iV files, K users, and cache 
size M such that MK/N is a positive integer less than to K. 



procedure Placement^ , . . . , W N ) 

t <- MK/N 

Z<-{Tc[K\:\T\=t} 
for n E [N] do 

split W n into (W n /r : T E T) of equal size 
end for 
for k E [K] do 

Z k <r- {W n , T ■ n E [N},T El,k ET) 
end for 
end procedure 

procedure Delivery(Wi , . . . , W N , d u . . . , d K ) 

t <- MK/N 

& <- {S C [K] : |5| = t + 1} 
X( dl ,...,d K ) <- {®kesW dk ,s\{ k } :SE&) 
end procedure 
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VI. Converse (Proof of Theorem [21) 

Let s G {1, . . . , min{iV, K}} and consider the first s caches Zi, . . . , Z s . There exists an input to the 
shared link, say X\, such that X\ and Z\, . . . , Z s determine the files Wi, . . . , W s . Similarly, there exists an 
input to the shared link, say X 2 , such that X 2 and Z 1 , . . . , Z s determine the files W s+1 , . . . , W 2s . Continue 
in the same manner selecting appropriate X 3 , . . . , X^ N / S j, see Fig. [6] We then have that X\, . . . , X^ N / S ^ 
and Zx,...,Z 8 determine W\, . . . , W s \n/ s \ ■ 



' w 4 \ 



Eh 



1 w 2 

s. _ s 



Zl 



W 3 

N, S 



w 2 



w 4 



z 4 



Fig. 6. Cut corresponding to parameter s — 2 in the proof of the converse. In the figure, N = K = 4, and Xi = ^(1,2,3,4), X2 = ^(4,3,2,1). 



Consider then the cut separating X\, . . . , X\^/ s \ and Zi, . . . ,Z S from the corresponding receivers as 
indicated in Fig. [6] By the cut-set bound ifTOl Theorem 14.10.1], 

s[N/s\ < [N/s\R*(M) + sM. 

Solving for R* and optimizing over all possible choices of s, we obtain 

R*(M)> max (a - , J. , m], 

se{l,...,mm{N,K}} \ [N/sj J 

proving the theorem. ■ 

VII. Approximation of R*(M) (Proof of Theorem [3]) 

We now compare the upper bound on R*(M) in Theorem [2] to the rate R(M) achieved by the proposed 
scheme given by the lower convex envelope of the points 

for M a multiple of N/K as described in Theorem 1. The proof is based on the following observation. 
The function R(M) has three distinct regimes as depicted in Fig. [71 The first regime is for M between 
and approximately max{2, N/K}. In this regime, R(M) is close to linear. The second regime is for 
M approximately between max{2, N/K} and N/2, in which R(M) is nonlinear and behaves essentially 
like N/M. The third regime is for M between approximately N/2 and N, in which R(M) is again close 
to linear. We bound the ratio R k (M)/R(M) separately in each of these regimes — though, to optimize the 
constants, we will choose slightly different definitions of the boundaries for the three regions. 

It will be convenient to consider the two cases min{iV, K} < 12 and min{iV, K} > 13 separately. We 
have 

R{M) < min{iV, K}(1 - M/N) 
by Theorem 1 . On the other hand, setting s = 1 in Theorem [2] yields 

R*(M) > 1 - M/N. 
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min{Af, K} 



mm{N, K}/2 




linear 



nonlinear 



Fig. 7. Achievable rate R(M) (solid curve) as defined in Theorem 1 for N = 20 files and K = 10. The function R(M) has three distinct 
regimes, which are approximately < M < max{2, N/K}, max{2, N/K} < M < N/2, and N/2 < M < N. In the first and third 
regimes, R(M) is essentially linear; in the second regime, R(M) is nonlinear (as indicated by the dotted curve). 



Hence, 

R ^ M \ < miniiV, K} < 12 (2) 

for min{iV, K} < 12. 

Assume in the following that min{iV, K] > 13. We consider the three cases < M < 1.1 max{l, N/K}, 
l.lmax{l, N/K} < M < 0.092iV, and 0.092iV < M < N separately. Assume first that < M < 
l.lmaxjl, N/K}. We have 

R(M) < R(0) < min{iV, K} (3) 
by Theorem 1. By Theorem [2l and using that [N/s\ > N/s — 1, 

R*(M)>s **L. 

V ; ~ 1-s/NN 

Setting s= [0.275 min{iV, K}\ G {1, . . . , min{iV, K}}, we obtain for M < 1.1 max{l, N/K} 
R*(M) > R* (1.1 max{l, N/K}) 

> [0.275 min{ iV. A -}J - ao^min^.A-}])* 1.1m,(1, W 
~ L 1- [0.275 mm{N,K}\/N N 

( 1 1 1-0 275 2 

> min{iV, K} (0.275 - — ^ - x _ a2?5 min{1 ^ /iV} 

>min{iV,^}f 0.275 ' 11 (, ' 27V 



13 1 - 0.275 

>^min{iV,K}, (4) 
where we have used the assumption that min{A r , K} > 13. Combining © and © yields 

4^ < 12 (5) 
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for < M < 1.1 max{l, N/K} and min{iV, K} > 13. 

Assume then that 1.1 max{l, N/K} < M < 0.092iV. Let M be the largest multiple of N/K less than 
or equal to M, so that 

< M - N/K <M<M. 



Then, by Theorem 1, 



R{M) < R(M) 

K(l - M/N) 



< 



< 



1 + KM /N 
K(l - M/N+l/K) 



1 + KM/N - 1 
< N/M, (6) 

where we have used that M/N > l/K'm the last inequality. Setting s = [0.3N/M\ e {1, . . . , min{A^, K}} 
in Theorem [2l we obtain 

s 2 

R*(M) > s M 

v ' ~ N - s 

0.3N 0.3 2 N 2 /M 2 
~ M ~ 1 ~ N - 0.3N/M 
N /„ „ 0.3 2 



> — (0.3-0.092 



Combining © and © yields 



M\ 1-0.3/1.1 

> (7) 
- 12M v ' 

R(M) 

for 1.1 max{l, N/K} < M < 0.092A^. 

Finally, assume 0.092A^ < M < N. Let M be the largest multiple of N/K less than or equal to 
0.092iV, so that 

< 0.092iV - N/K <M< 0.092iV. 
Then, using convexity of R(-) and that R(N) = 0, 

R(M) < R ^ (1 - M/N) 
V ; _ 1 — M/N ' ' 

< R(M)(1 - M/N), 

~ 1 - 0.092 v A 11 

where we have used that M < 0.092A^. Now, by Theorem 1, 

R(M) < 1 



< 



M/N + \/K 
1 

0.092 - 1/K + l/K 
1 

0.092' 
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Hence, 



R(M) < ; -(1 - M/N) 

v ' ~ 0.092(1 - 0.092) v 1 ' 



< 12(1 -M/N). (9) 

Setting s = 1 in Theorem [2l we obtain 

R*(M) > 1 - M/N. (10) 

Combining © and £C1 yields 

^<12 (11) 
R*(M) ~ 

for 0.092iV < M < N and min{iV, if} > 13. 
Combining ©, ©, ®, and CCD yields 

^£I<12 
R*(M) ~ 

for all JV, K and all < M < N. ■ 



Appendix 

Example |4] in Section |IV] derives the upper and lower bounds on the memory-rate tradeoff R*(M) 
claimed in Theorems 1 and [2] for K = N = 2. While these two bounds are sufficient to characterize 
R*(M) to within a bounded multiplicative gap, as we show in what follows, neither the achievable region 
nor the converse are tight in general. 

We start by arguing that the achievable scheme can be improved by focusing on the case M = 1/2. As 
before, we split each file into two subfiles, i.e., A = (Ai, A 2 ) and B = (S 1; B 2 ). In the placement phase, 
we choose the cache contents as Z\ = A\ © B\ and Z 2 = A 2 © B 2 . Assume that user one requests file 
A and user two requests file B. The server can satisfy these requests by transmitting X(i >2 ) = (Bi,A 2 ) 
at rate R = 1. The other three possible requests can be satisfied in a similar manner. This shows that 
(M, R) pair (1/2, 1) is achievable. This new point improves the boundary of the achievable region from 
the solid blue to the dotted black curve in Fig. [3] in Section [TV] 

We now argue that the converse can also be improved. Referring to Fig. |2(b)| in Section UH we have 
for any achievable memory-rate pair (M,R), 

2RF + 2MF > H{X {1>2) , Z x ) + H{X {2>1) ,Z 2 ) 

= H(X (1 $,Z 1 \W 1 ) + H{X m ,Z 2 \W x ) + I(W 1 ; X (lj2) , Z x ) + I(W r , X (2>1) , Z 2 ) 

> H(x {1>2) , z u x (2jl) , z 2 \w x ) + i(w i; x (lj2) , z{j + i(w r , x (2jl) , z 2 ) 

> I(W 2 ; x (1>2) , z u x (2>lh Z 2 \W X ) + I(Wv, Xw,Z x ) + I(W X ; X {2 ,i),Z 2 ). 

Now, since W\ can be decoded from (X(i^), Z x ) and also from (Xqi), Z 2 ), and since W 2 can be decoded 
from (X( X >2 ), Z x , X(2 t i), Z 2 ), we obtain using Fano's inequality that 

I(W x ;X {1>2) ,Z x )>F-eF, 
I{W x ;X {2>x) ,Z 2 )>F-eF, 
I(W 2 ; X (1>2) , Z x , X (2 ,i), Z 2 \W X ) > F — eF 

for any e > as long as F is large enough. Hence, 

2RF + 2MF > 3F - 3eF. 
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Since e > is arbitrary, this shows that 

R > 3/2 - 3M/2. 
Since this is true for any achievable (M, R) pair, this implies that 

R*(M) > 3/2 - 3M/2. 

This bound, together with the two cut-set bounds derived earlier, proves that the dotted black curve 
depicted in Fig. [3] in Section HVl is indeed equal to R*{M). 
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